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to reliable downlink transmission in multi-cell multiuser networks. Harnessing such interfer- 
ences largely hinges on two levels of information exchange in the network: one from the users 
to the base-stations (feedback) and the other one among the base-stations (cooperation). We 
demonstrate that exchanging a finite number of bits across the network, in the form of feedback 



■ and cooperation, is adequate for achieving the optimal capacity scaling. We also show that the 

average level of information exchange is independent of the number of users in the network. 
' This level of information exchange is considerably less than that required by the existing coor- 

>• ! 

dination strategies which necessitate exchanging infinite bits across the network for achieving 
the optimal sum-rate capacity scaling. The results provided rely on a constructive proof. 

ON 

cn 

O ■ 1 Introduction 

Multiple-antenna broadcast transmission is known to offer substantial improvement in spectral 
efficiency via spatial multiplexing. Such gains depend heavily on the availability of some channel 
\ state information (CSI) at the transmitter. Acquiring such CSI allows for managing the intra-cell 

interference which is due to spatial multiplexing. There exists a rich literature on the design of 
user scheduling that spans a wide range of transceiver designs with different levels of complexity 
and CSI feedback requirements [HOE]. 

In multi-cell multiuser networks, besides the intra-cell interference caused by the residual in- 
terference remaining after spatial multiplexing within each cell, another impediment arises from 
inter-cell interference due to the ever-shrinking cell sizes, which makes the networks operate in 
interference-limited, rather than noise-limited, regimes. Alleviating the effects of inter-cell interfer- 
ence requires the base-stations to cooperatively adjust their transmissions [UE1E1E]- Coordination 
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among the base-stations for downlink transmission imposes some information exchange (coopera- 
tion) among the base-stations. Once full CSI feedback and full cooperation is viable, i.e., all the 
users report their CSI to all base-stations, and all base-stations share their information streams, 
the multi-cell network can be essentially modeled as a single-cell multiple-input multiple-output 
(MIMO) broadcast channel with a transmit antenna array of a higher dimension. Using the liter- 
ature on MIMO broadcast channel immediately establishes the optimal sum-rate capacity scaling 
of multi-cell multiuser networks which is achievable via dirty-paper coding [3]. 

Such optimal capacity scaling, however, necessitates full feedback and cooperation across the 
network. This requirement, specially in large networks, can potentially overburden the feedback 
channels as well as the backbone channels that connect the base-stations. Therefore, it becomes 
imperative to study whether such full information exchange is all necessary or contains some re- 
dundancy that can be avoided. The question we are specifically trying to address is: what is 
the minimum amount of information exchange, in the form of either CSI feedback or base-station 
cooperation, that ensures retaining the optimal sum-rate capacity scaling? 

We consider a multi-cell network model, in which every few neighboring cells are collectively 
regarded as a super-cell that serve the users within that super-cell collaboratively. The analy- 
sis demonstrates that for the purpose of achieving the optimal scaling, cooperation among the 
super-cells, while desirable, is not necessary. In other words, while collective transmission by the 
base-stations across the super-cells has certain advantages, as long as achieving the optimal capac- 
ity scaling is concerned, the super-cells can decide about their scheduling policies independently. 
Furthermore, each user has to provide feedback to only the base-stations of its designated super- 
cell. These two relaxations essentially imply that base-stations of each super-cell collect only local 
information and any information exchange across super-cells are avoided. We also quantify the 
aggregate amount of information exchange necessary for achieving the optimal sum-rate capacity 
scaling. Specifically, we show that when the network contains M super-cells, each consisting of 
Q base-stations and when each base-station is equipped with Nt transmit antennas, the minimum 
aggregate amount of information exchange in the network to guarantee optimal capacity scaling is 
upper bounded by MQNt [log QNt] , which is independent of the number of users in the network. 

These results in this paper are provided through a constructive proof. The flow of the proof is 
as follows. After discussing the network model in Section [2] we describe a scheduling procedure and 
feedback mechanism in Section [3l Next, in Section 2] we show that proposed scheduler yields the 
optimal sum-rate capacity scaling. Then in Section[5]we assess the aggregate amount of information 
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exchange that the proposed scheduler requires. Section [6] addresses some practical issues, e.g., the 
impact of the size of the super-cells and fairness, and finally Section [7] concludes the paper. To 
enhance the flow of the content, most of the proofs are relegated in the appendices. 

We remark that as opposed to multi-cell MIMO networks, there exists a rich literature on single- 
cell downlink transmission that investigate the interplay between the feedback load and broadcast 
sum-rate throughput scaling. One group of studies, c.f. OEKlO], for single-cell systems consider 
feeding back unquantized real values (channel magnitude or SNR, which in theory requires infinite- 
rate feedback. In another line of studies, c.f., [11], finite-rate feedback mechanisms are introduced. 
However, the optimal sum-rate throughput is achievable in these studies when the number of 
feedback bits tends to infinity. The proposed feedback mechanisms in [12} 113]. impose only finite- 
rate feedback loads and achieve the optimal multiuser diversity gain. However, since they do not 
allow for simultaneous transmissions to multiple users, they cannot capture the multiplexing gain. 
Finally, the heuristic feedback mechanisms proposed in [141 [T5] have limited feedback loads for 
which the sum-rate throughput scaling analysis is not provided. As a by-product of our analysis 
we find that in a single-cell downlink transmission it is possible to achieve the optimal capacity 
scaling rate with finite-rate feedback. It is also noteworthy on a different research direction, that 
capacity scaling for some infrastructure-based sensor networks are provided in [16], [17] 

2 System Model 

Consider the downlink transmission in a MIMO network that consists of M super-cells, each com- 
prising of Q regular cells equipped with one base-station. We use the shorthand Q m to refer to the 
m th super-cell and denote its base-stations by B r m , for r G {1, . . . , Q}. Let K denote the number of 
users per super-cell and assume that the base-stations of each super-cell accomplish the downlink 
transmissions collaboratively. Moreover, we assume that all the base-stations within a super-cell 
are fully cooperative, i.e., are connected through a backbone link that allows them to share their 
information streams and to carry out fully coordinated downlink transmissions. The case that such 
cooperation is not viable corresponds to the setting Q = 1. We also use the shorthand J7* to denote 
the k th user in the n th super-cell, respectively, for k G {1, . . . , K} and n G {1, . . . , M}. 

Define H^ r m G (£ NrXNt as the downlink channel from base-station B T m to user U£ and let 
x r m (t) G C^'* 1 denote the signal vector transmitted by base-station B T m at time instance t. The 
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received signal by E/„ at time instance t is therefore given by 

M Q 

»*(*) = E E v-A™ ^ftm^w + **(*) • 

m=l r=l 

where 7n|m G is incorporated to capitalize the combined effect of transmission power and signal 
attenuation (path- loss) between and Ufi- In non-homogenous networks the terms {7n|m} are 
distinct as different users undergo different path-loss and shadowing effects. Also, z* G ^N r xi 
accounts for the additive white Gaussian noise distributed as £A/"(0, Jjv r )- We consider block 
frequency-flat fading channels and assume that the entries of the channel matrix H^ r m are inde- 
pendent and identically distributed as complex Gaussian CJ\f(0, 1). In order to facilitate spatial 
multiplexing and to harness its ensuing interference as well as the inter-cell interference, the base- 
stations employ linear precoding. We define {wj n (£)}^ 1 as the set of linear precoders employed by 
base-station B r m . By further defining x r m (l,t) as the information symbol that B T m transmits along 
the beamforming vector u r m (€), at any time instance t we have 

N t 

x r m (t) = J2x r m (£,t)^ m (£) . (1) 
1=1 

Motivated by refraining from information feedback about channel directions the base-stations ex- 
ploit the notion of orthogonal random beamforming first proposed in [8]. For this purpose, at the be- 
ginning of each block transmission, each base-station generates Nt orthonormal vectors {^m(^)}^ii 
according to an isotropic distribution |18j . The user 17* knows H% r n u r n {£) for i G {1, . . . , N t } and 
r G {1, . . . , Q} perfectly and instantaneously. We denote the transmission power per information 
stream by p = -j^-, where P is the average power per base-station. Also, throughout the paper, 
we use the convention clk = bx to denote asymptotic equality, i.e., lim^_ i . 00 j£- = 1. Operators < 
and > are defined accordingly. All the logarithms, unless otherwise mentioned, are in base 2 and 
throughput are in bits/sec/Hz. 

3 Scheduling Framework 

We first consider the case that the users have one receive antenna and will later generalize it the 
case of arbitrary number of receive antennas in Section SJ We assume that the users employ single- 
user decoders such that all the inter-cell and intra-cell interferers are treated as Gaussian noise. 
While more advanced receivers will offer better performance, we show that the simple single-user 
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decoders are sufficient to fully capture the optimal multiplexing and multiuser diversity gains in 
MIMO networks. 

Due to full cooperation, the information stream that some base-station in the super-cell Q n 
intends to transmit to user is known to all base-stations of that super-cell. Therefore, the 
message intended for user U% can be transmitted along any arbitrary beamforming vector of any 
base-station of Q n . Hence, user U% can compute the following Q x Nt distinct SINRs given that its 
intended information is embedded in x^(i, t) and transmitted by the base-station B J m along u r n (£), 
where r € {1, . . . , Q} and £ G {1, . . . , N t }, 

S\NR k (r£) = 7n,n|JJ n ,n^nWI (2) 

It can be readily verified that for any given £ and for any (k,n,r) ^ (k',n',r'), the numerators 
and denumerators of SINR^(r, £) and SINR^(r',^) are disjoint. In other words, any term of the 
form H% r m u r n (£) that appears in the characterization of SINR^(r, £) does not appear in that of 
SINR^/(r', £). By invoking the statistical independence of the channel coefficients and beamform- 
ing vectors, we find that SINR^(r, £) and SINR^/(r',^) are also statistically independent for any 
(k,n,r) (k',n',r'). However, by noting the different attenuation factors {7n|m} for different 
choices of k,n,r, SINR^(r, £) and SINR^/(r',^) are not identically distributed. Similarly it can be 
shown that for any fixed set of k, n, r the terms {SINR^(r, £)} are statistically identical but not inde- 
pendent. We denote the cumulative distribution function (CDF) of SINR^(r, £) for £ € {1, . . . , Nt} 
by 

F n fe (x;r):M + ^[0,l] . 

We propose a user scheduling procedure that aims at identifying and serving QN users in each 
super-cell such that 1) the network sum-rate throughput captures the full multiuser diversity gain, 2) 
does not impose any information exchange or cooperation among the super-cells, and 3) necessitates 
each base-station to acquire only a finite number of feedback bits from each of its designated users. 
The steps involved in the proposed scheduling procedure are summarized as follows. We remark 
that the notion of random beamforming, first deployed in [8], has been the core of many scheduling 
schemes for single-cell broadcast channels. 

1) Normalizing SINRs: For each user we define Q positive scalars {/3^(1), . . . , /3^(Q)} each 
pertaining to one of the base-station of the super-cell Q n . 

Vr € {!,..., Q} : define /3*(r) such that F n fc (/3*(r); r) = 1 - 1 . (3) 



There exist unique solutions for /3%(r) as F%(x; r) is a CDF and strictly monotonic in x. Next, 
corresponding to each user U% we define the following normalized SINR terms. 

s\m k n (r,£)^ "\' ] , Vr€{l,...,Q} and £ G {1, . . . , N t } . (4) 

2) Feedback: 

1. Recall that each user U% can be served along any beamformer of the super-cell Q n . 
Among all beamformers, we identify the most favorable beam for the user U k , that is 
the beamformer yielding the highest SINR for U k , i.e., 

sinr^ = maxsinr^(r, £) . (5) 

r, I 

2. If sinr,^ > 1, then user £7* feeds back the index of its most favorable beam to the base- 
stations of the super-cell Q n . As a super-cell has QNt beamformers, conveying such 
feedback requires [log QNt] information bits. 



3) User Selection: 

1. Based on the information fed back by the users, corresponding to each super-cell Q n 
we construct the sets "W n {l\ for r G {1, . . . , Q} and £ G {1, . . . , Nt}, such that 'W n (€) 
contains the indices of the users in Q n who have declared u r n (I) as their most favorable 
beam. By invoking (JSJ we have 

H r n (£) = \k | imr^ > 1 and imr^ = sinr^(r,^)} . (6) 

2. Super-cell Q n randomly selects one user from each set % r n (£) for r G {1,...,Q} and 
£ G {1, . . . , Nt} and embeds the information signal intended for this user in the stream 
x r n {£). Note that the sets {H r n {£)}r,i are mutually disjoint and as a result, it is guaranteed 
that a user will not be served by more than one beam. 

This procedure involves information exchange only in the form of feedback from the users to their 
designated base-stations and the super-cells do not cooperate or exchange any information. The 
crucial part of such scheduling is the appropriate design of the normalization factors /^(r) that 
results in a user selection yielding the optimal capacity scaling. The choices of the normalization 
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factors as characterized in (J3j) serve a two-fold purpose, namely they control the aggregate amount 
of feedback and maintain fairness. As the network size K increases, the normalization factors /^(r) 
as defined in ([3]) affect the feedback load in two ways. On one hand more users are competing for 
accessing the resources and it is expected that more users participate in the feedback process. On the 
other hand, as K increases the normalization factors increase, which makes it harder for individual 
users in a network to satisfy the conditions for feeding back information to their designated base 
stations. As shown later in the paper, the combined impact of these two effects is in favor of 
reducing the feedback load. Also, as for the fairness, incorporating the statistics of the SINRs in 
the choices of the normalization factors ensures that all the users are equiprobable in satisfying their 
feedback constraint and consequently by invoking the selection procedure, all are equiprobable in 
having access to the resources. The proposed scheduling mechanism requires the users acquire the 
SINRs yielded by different beamformers. Such SINR acquisition can be accomplished during the 
training phase when the users obtain the channel states prior to the downlink transmission of data. 

4 Throughput Scaling Analysis 

When in a MIMO network perfect cooperation (full information exchange) among all super-cells is 
allowed, it can be effectively modeled as a large single-cell multiple-antenna broadcast channel with 
MQNt transmit antennas and MK users. In [3l [8] it shown that for such downlink transmission 
the sum-rate capacity, which is achievable via dirty-paper coding, scales with increasing K as 

E H [R] = MQN t log log KN r . (7) 

In the sequel we analyze the sum-rate throughput scaling of our proposed procedure and demon- 
strate that the achievable throughput scales similar to ([7]). 

4.1 Throughput Scaling for N r = 1 

By defining R n as the sum-rate throughput achieved for the n th super-cell, the expected sum- 
rate throughput of the network is Ejj[i?] = X)n=i ^h[Rti\- We further define R n {() as the rate 
supported through the transmission along the beamforming vector u r n (£). As formalized in ([6]), the 
super-cell Q n constructs the new set H n , as the set of the user index sets T-L n (£) f° r an r an d ^> 
i.e., T~L n = {Unity} r ,e- Conditioned on the sets H n , the sum-rate throughput supported in the n th 
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super-cell is the summation of the rates its individual beamformers can sustain. Therefore, 

Q N t 

^h[Ru I U n \ = EE E »K0 | HUt)} . (8) 

r=l 1=1 

As designated by the user scheduling procedure, for each r G {1, . . . , Q} and £ € {1, . . . , iVt}, one 
of the members of the set "H^(^) is opted randomly to be served along the beamforming vector 
u r n {l). Hence, each user with its index in H r n (l) will be selected to be served along the beamformer 
w r n {&) with probability p^Fpyy- As a result given the set of candidate users rl r n {tj, the expected 
value of the rate sustained by the beamformer u) r n {£) is equal to the arithmetic average of the rates 
these candidate users can sustain. By noting that if the transmission of the beamformer u>^(^) is 
intended for user U% then R r n {£) = log(l + SINR^(r,£)), we obtain 

%KWI« = ^tL E E„ [log(l + SINRj;M)) \H r n {£)]. (9) 

The next lemma, which provides a lower bound on Ejj[i?^(-£) | Ti^i)] is instrumental to analyzing 
the scaling rate of K H [R r n (£) | U r n {l)]. 

Lemma 1 (Conditional Expected Rates) If P%(r) > 1 Vn G {1,...,M}, \/k G 
anc? Vr G {1, . . . , Q} ; then 

EhKWIKW] > W.IKWI) = y^yf E Eh [log(l + SINR(f)(r^))] , (10) 

where S\NR^\r,£) denotes the j th largest element of the set {SINR^r, £), . . . , SINRf (r,£)}. 

Proof: See Appendix [A] ■ 
Equations ([8]) and ([9]) along with Lemma Q] (conditional expected rates) establish that if Vfc, n, r 
the normalizing factors satisfy the constraints f3*(r) > 1, then we have 

Q N t 

H n r,l 

Note that the rates TZn(£, |%nWI) are identical for all permutations of the set H^l) that have the 
same cardinality. In other words, TZ r n {l^\H r n {t)\) depends on Tl r n (t) only through its cardinality 
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which can be any member of {0, . . . ,K}. Note that (jlip is a weighted sum of the rates lZ r n {£,b) 
for i £ {1, . . . , Nt}, r £ {1, . . . , Q}, and b £ {1, . . . , K}, and in order to further simplify (jlip we 
need to find their weighting coefficients. For this purpose we rearrange all the relevant terms that 
contribute to 7U n (£, k) and find that 



coefficient of TZ r n (£, b) = E = P (\*W)\ = b) 

{H n | \H- n (l)\=b} 



(12) 



Therefore, combining (|TTj) and (fl~2j) yields that if \/k,n,r the constraints /3^(r) > 1 are satisfied, 
then 



%[«n] > ^TP(\-H r n (l)\ = b)ll r n (£,b) 

E fU ""^ l=P; X)EH [log(l + SINRW (r ,/)) 



r, t, b 

^ P(Kffl|_= &) 

r, L b 



A 



E Eh [log(l + SINR^(r,£))jE 



E <&M)E«[tog(l + SINR£>(r,*) 



= 



(13) 



(14) 



By setting g°(r, £) = P(\H r n {£)\ = 0) we find that for any fixed n, r, £ the sequence {qn(r, £),..., q% (r, £)} 
constitutes a probability mass function (PMF) as 

K K K 



E «iM=«SM+EE 



P(Kffl| = fc) 
it 



4 + E r E P(\K®\ =k) = j^P{\H 



k 

k=l 3=1 



k) = 1 . 



fc=0 



Now, we assess the scaling behavior of R n which in turn provides a lower bound on the throughput 
scaling of our proposed scheduling procedure. For this purpose, based on (|14|) . we need to obtain the 
distributions of order statistics {SINR^(r,^),.. . , SINR^ } (r, £)} for n £ {1,. . . ,M}, r e {1,... ,Q}, 
and £ £ {1, . . . ,iV t }. As mentioned earlier the SINR terms {SINR^r, f), . . . ,SINRf (r,£)} are inde- 
pendent but not identical and thereof characterizing the distributions of their associated sequence 
of order statistics does not seem tractable. For tractability purposes, for each n £ {1,...,M}, 
r £ {1, . . . , Q}, and £ £ {1, . . . , Nt} we construct two other sets one containing lower bounds and 
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the other containing upper bounds on {SINR* (r, £),... ,SINR*(r,£)}. We define 

Cn = m j n 7n!m and ri l n = mhi7^ , 

m,k,r k,r 

Cn = maxTnlm and if n = max 7^ . 

m,k,r k,r 

Based on these we define the following lower bound on SINR^(r, I) 

3 lM = 7T7 l^'>nffl| 2 (15) 

& (E^ l^>^')l 2 - |i^;>^)l 2 ) + ii 



and the following upper bound on SINR*(^) 

T k (r £) = \ H n!u<(£)\ 2 (lg) 

I l^£<WI 2 - |J^;>^)| 2 ) + ^ ' 

Next we show that S k (r, £) and (r, and their associated order statistics have tractable distri- 
butions. 

Lemma 2 (SINR Bounds Distributions) For any given n G {1, . . . , M}, r G {1, . . . , Q}, and 

£ G {1, . . . ,Nt}, the elements of the set {S^(r,£)}^ =1 are i.i.d. with CDF 



SH(r,e) ~ F 1 (x;n) ± 1 - - C , (17) 

x + 1 



and t/ie elements of the set {T^(r,£)}^ =1 are i.i.d. with CDF 



T n(rJ) ~ F 2 (x;n) = 1 - — . (18) 



% x + 1) 



Proof: See Appendix [Bj ■ 
Assessing the scaling of -R n , defined in (I14p (which is a lower bound for E/f[i? n ]) requires the 
knowledge of the distributions of the order statistics {SINR^(r,£), . . . , SINR^^r, £)}. In the next 
lemma it is shown how SINR^ (r,£) is related to the order statistics of the sets {S^(r,£)}^ =1 and 
{T%(r,£)}% =1 . We denote the j th largest elements of the sets {S*(r,£)}% =l and {T%(r,£)}% =1 by 
S { J\r,£) and T#\r,£), respectively. 

Lemma 3 (Ordered SINR Bounds) For anyn G {1, . . . ,M}, r G {1, . . . , Q}, and I G {1, . . . , N t }, 
we have 

Vj G {1,...,K} : S#>M) < S\NR^(r,£)<T^(r,e) . 
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Proof: See Appendix O ■ 
By invoking Lemma [3] (ordered SINR bounds), from (|14p we find that if V/c, n, /3(n,k) > 1, then we 
have following bounds on R n . 

ql(r, £) E H [log (l + StiXrJ))] < R n < ]T <&M) E H [log (l + T^(r, £))] . (19) 

By denoting the CDFs of Sn\r,£) and Tn\r,£) by F^\x;n) and F^^x;?!), respectively, (fT9|) can 
be restated as 

poo / K \ / K 

V / log(l + *) d V^(r,£)if } (x;n) < i? n < V / log(l + x) d V F® (x; n) . 

r,i J ° S=l ' r,i J ° S=l ' 

(20) 

Note that 

K K 

{qi(r,£)}j is a PMF J^M) F^fon) and ^g£(r,i) F 2 (i) (z;n) are CDFs , 

J'=l 3=1 

and thereof we define the new CDFs 

K K 

Ff(x;n,r,£) =J2<f n (r,£) F?\x;n) and F 2 K (x;n,r,£) ± J>£(r,£) F 2 (i) (x;n) . 
Hence, we can further reformat (1201) as 



/"00 pOO 

\og(l + x)dF^(x;n,r,£) < R n < ^ log(l + x) dF 2 K (x;n,r,£) . (21) 



r, £ 1/0 r, I 

In the next step, we define an auxiliary random variable X distributed exponentially with unit 
mean and denote its CDF by 

G{x) = 1 - e~ x . 

Also let G^'{x) denote the CDF of the j th order statistic of a set of statistical samples with K 
members with the parent distribution G(x). Similar to F^ (x;n,£) and F 2 K (x;n, £) define 

K 

G K (x;n,r,£)^^2qi(r,£) G^{x) . 
3=1 

The motivation for defining this auxiliary exponential random variable is to ultimately obtain lower 
and upper bounds on R n that can be characterized as functions of G(x). Such characterization 
is instrumental in the sense that we will be able to use some well-known properties of the order 
statistics of exponential distribution for finding the scaling behavior of R n . The next lemma 
provides the key step for connecting the bounds given in (|2ip with new bounds that are functions 
of G{x). 
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Lemma 4 (Bounds on R n ) If tftfr) >lVnG {1,... ,M}, Vfc £ {1,... <md W € {1, . . . , Q}, 

i/ien toe /iave 

poo poo 

V/ log(l + /w;ix)dC? A: (x;n,r^)-A n <^ n < V / log(l + prgx) dG K (x;n,rJ) , (22) 
where A n = QN t log ((MQN t - l) P CW n + l) . 

Proof: See Appendix |Pl ■ 
To this end we have established lower and upper bounds on Rn based on the order statistics of the 
exponential distribution. As the final step, we assess the scaling of these bounds. The next lemma 
reveals how the scaling of the bounds on R n provided in Lemma S] (bounds on R n ), pertains to the 
characterization of the CDF G K (x;£), which itself depend on the PMFs {q n (r,i)}j- 

Lemma 5 (Scaling) For a sequence of K i.i.d unit-variance exponentially distributed random 
variables with CDF W(x) and for any arbitrary PMF {Qj}^ = i if the condition 

hm %^ = , (23) 

K— >oo A 

is satisfied, then for any positive real number a € M+ we have 

POO 

/ log(l + ax)dW K (x) = log log K + log a , 
Jo 

where 

K 

W K {x) = J2QjW U) (x) , 



an 



d is the CDF of the j th order statistic of the set of these K random variables. 



Proof: See Appendix lEl ■ 
By using the results of Lemmas [4] (bounds on R n ) and [5] (scaling) we state the main result of this 
paper in the next theorem. 

Theorem 1 (MISO Capacity Scaling) For downlink transmission in a MISO network with M 
super-cells each with Q base-stations, K users per super-cell and Nt transmit antenna per base- 
station, the sum-rate throughput achievable via the proposed scheduling procedure scales as 

Ejj [R] = MQN t log log K . 
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Proof: We start by showing that the choice of Qj = qk(r,£) for j G {1, . . . ,K}, where {q J n (r, £)} 
is defined in (jlip . fulfils the condition (|13j) of Lemma [5] (scaling) for any n G {1,...,M}, r G 
{1, . . . , Q}, and £ G {1, . . . , 2V t }. From ([13]) we have 

E iOi = Ej ^(m) = X>E P(K( fc )l = fc) 

= E^(kwi=*)Ej 

fc=i j=i 

= E^- p (iKWi = ^) 
fc=i 

= 2E k p WnW\ = k) + 2E p (KWI = *) - 2 P (IKWI = o) 

fc=l A;=0 



<ln\n r n m + l- (24) 

Note that the probability that each individual user in the n th super-cell expresses interest in the I th 
beam of B r n , i.e., w r n {£), by feeding back its index is -g. As shown at the beginning of Appendix lAl 
we have 

if/8*(r)>l { SINR*(r,*)>0*(r) & k G U r n {£) }. 

Therefore, 

P(k G KW) = P(siNR*(r,*) > 0*(r)) = 1 - F^S\NR k n (r,£);r) § 1 . 

Since |%^(£)| counts the number of users in the ra t/l super-cell that have expressed interest in 
beamformer u r n (£), we have 

\H r n (£)\ ~ Binomial^, 1/if) E[|7^)|] = if • = 1 . (25) 

ii 



Equations and ([25]) provide that 



Ei^i < 1 =* lim 

^ — ' J K— inn K 



^ = 0. 



Therefore, the necessary condition of Lemma [5] (scaling) given in (|23p is satisfied and we can apply 
Lemma [5] (scaling) on the bounds on R n provided in Lemma U (bounds on R n ). By setting 

W{x) = G{x) ,Vj Qj = qi(r,£), and a = prfc, prf n , 

13 



2.5 







: 


1.5 - 









1 - 


ng fad 


0.5 - 


'n 




-mal: 



10 


Noi 





SNR = 


= 10 dB 


SNR = 


= 5dB 


SNR = 


= 0dB 




Figure 1: Normalizing factors /^(r) versus the number of users per super-cell K, for M = 3, Q = 2 
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according to Lemma [J] (bounds on R n ) we find 

QN t log log K + QN t log ( nrnAT ^Y^-T-rr ) ^ R n < QN t log log K + QN t log(p^) . 

\{MQN t - l)pQvk + 1 / 

The bounds above demonstrate that for fixed M, Nt,p, rj^ and r/^, and for increasing K we have 

J im nAT ^ Rn ^ u = 1 ^ ^ = W log log if . 
K^oo QN t log log K 

Taking into account that the sum-rate throughput cannot scale faster than what is achieved via 
dirty paper coding, i.e., MQN t log log K, the proof is complete. ■ 
All the analyses are constructed on the basis that Vfc, n, r, we have /^(r) > 1. As discussed 
earlier, under this condition, each user will have only one of its normalized SINRs (corresponding to 
different beamformers) greater than 1. Also, as implied throughout the analysis, at the asymptote 
of large K, in each super-cell only QNt users will participate in the feedback processes, which 
is the core reason of having controlled amount of feedback. The condition /3^(r) > 1 can be 
guaranteed in the asymptote of large K by recalling that F^(P^(r);r) = 1 — i. By numerically 
evaluating F^(j3^(r);r) through Monte Carlo realizations, Fig.Q]shows how the normalization factor 
changes with the number of users at SNRs. It is seen that in a network with M = 3 super-cells, 
each containing Q = 2 cells and each base-station equipped with Nt = 2 transmit antennas the 
normalizing factors /3^(r) very quickly exceed 1 (as the requirement of Lemma [T] on the conditional 
expected rates) for different SNR levels 0,5 and 10 dB. The normalization factors are obtained via 
numerically solving (|3|). 
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4.2 Throughput Scaling for N r > 1 

In this section we generalize the result of Theorem Q] (MIS O capacity scaling) to MIMO networks 
with arbitrary number of receive antennas. As stated earlier in ([7]), by employing dirty-paper coding 
and facilitating full CSI feedback, the sum-rate throughput exhibits a double logarithmic growth 
with the number of receive antennas, N r . We show that with slight modification to the scheduling 
procedure provided in Section [3] the same gain can be retained. 

We modify the scheduling procedure such that it allows each user to receive more than one 
information stream across its different receive antennas. In other words, we preclude the receive 
antennas to jointly decode the information streams and consider different receive antennas as sep- 
arate users. As a result, this translates to having effectively KN r users in the network and all the 
analyses provided earlier for K users can be extended for the network with KN r users. It is note- 
worthy that although it might not be the most effective way to exploit the receive antennas, it yet 
achieves the optimal capacity scaling. Indeed, more sophisticated approaches involving joint signal 
decoding by the antenna arrays are expected to achieve the same gain. Based on this discussion, 
we modify the normalization factors /3^(r) such that they satisfy 



and use the same normalization factor for all antennas of the same user. According to the modified 
scheduling algorithm on one hand, each user might be required to feed back the indices of more 
than one beam, which potentially increases the amount of feedback amount per user. On the other 
hand, increasing the normalization factor makes it more stringent for the users to satisfy their 
designated normalization factor constraints and send feedback. The discussions on the aggregate 
amount of feedback and effects of the number of users and number of receive antennas are given in 
Section [5j As a direct result of Theorem [T] (MIS O capacity scaling) we obtain the following scaling 
law for MIMO networks. 

Theorem 2 (MIMO Capacity Scaling) For downlink transmission in a MIMO network with 
M super-cells each with Q base- stations, K users per super-cell, Nf transmit antenna per base- 
station, and N r receive antennas pre user, the sum-rate throughput achievable via the proposed 
scheduling procedure scales as 




(26) 



E H [R] = MQN t log log KN r . 
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Figure 2: Sum-rate throughput versus the number of users per super-cell K, for Q = 2 and /3 = 10 
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da. Facn user has 7V r = 1 receive antenna. 



We remark that in the scheduling procedure we have considered that each user may receive more 
than one information stream. On the other hand, if we restrict each user to receive no more than one 
information stream, the performance will degrade. Indeed, as shown in [8] for single-cell broadcast 
transmissions, even with perfect SINR feedback, the sum-rate throughput scales as Nt log log K 
which does not depend on the number of receive antennas. Therefore, with such restriction, adding 
more receiver antennas offers no asymptotic gain. Simulation results in Fig. [2] depict the sum-rate 
throughput achieved by the proposed scheduling scheme and show that it varies linearly with the 
number of super-cells M and transmit antennas Nt and double logarithmically with the number of 
users K. 

Finally, as a corollary to Theorem [2] we can recover the existing results on the capacity scaling 
of the single-cell broadcast channels. The result provided in the following corollary indicates that 
deploying the proposed scheduling scheme for only one super-cell, requires only finite-rate feedback 
and yet retains the optimal capacity scaling of the broadcast channels with full channel state 
information feedback. Similar to the results for the multi-cell networks, in the following corollary 
we also assume that the number of transmit and receive antennas N,N r are fixed. 

Corollary 1 (Single-cell Capacity Scaling) For a MIMO broadcast channel with Nt transmit 
antenna that serves K users each equipped with N r receive antennas, the sum-rate throughput 
achievable via the proposed scheduling procedure scales as 

®h[R] =N t log log KN r . 
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5 Aggregate Feedback 



So far we have established the sum-rate throughput scaling. In this section we evaluate the expected 
amount of feedback bits imposed by the proposed scheduling procedure. This essentially provides 
an upper bound on the minimum required amount of information exchange in the network. As 
discussed in Section El if a user meets the criterion to be considered as a candidate for being 
selected and scheduled, it will feed back [log QNt] bits per eligible receive antenna, and otherwise 
will remain silent indicating that it is not a candidate for being scheduled. Therefore, each user, 
potentially, might send as many as rnin(iVt, N r ) [log QNt] bits via the feedback channel. This, in 
turn, can potentially give rise to up to K min(A r i, N r ) [log QNt] aggregate feedback bits in each 
super-cell, which can be prohibitive. Therefore, it is imperative to examine and control the burden 
put on the feedback channel. 

Note that the normalization factors /^(r) change adaptively with varying K. Therefore, as 
the number of users per super-cell increases, the chance that each individual user satisfies the 
normalization constraint becomes slimmer. In other words, the choices of /3%(r) guarantee that 
as the network becomes dense, the aggregate feedback load is controlled via enforcing a more 
stringent constraint on each user to utilize the feedback channel. In practice, obtaining /8*(r) 
through solving can be carried out via bi-sectional search. As the CDF is an strictly increasing 
function in its argument and the problem in (j3J) has a unique solution, a simple bi-section search, 
which is computationally efficient, can solve the problem with arbitrary accuracy. In the next 
theorem we quantify the aggregate feedback and show that the expected aggregate feedback is a 
constant independent of the number of users in the network. 

Theorem 3 (Aggregate Feedback) For downlink transmission in a MIMO network with M 
super-cells each with Q base- stations, K users per super-cell, N transmit antenna per base-station, 
and N r receive antennas pre user, by deploying the proposed scheduling procedure, the aggregate 
feedback in the network is MQN t [log QNt] ■ 

Proof: Lets define N = mm(Nt, N r ), which is the maximum number of information streams a 
user can receive. Also let S\NR*(r,£,i) denote the detection SINR of i th receive antenna of user 
when served along the beamforming vector u>^(£). The probability that the i th receive antenna of 
user £/~ is a candidate to be served by one of the QN t beamformers of its designated super-cell, is 
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upper bounded by 



/ SINR (r f i) \ union bound Jl ' / 



Hence, the probability that exactly j < N users satisfy the normalization constraint is upper 
bounded by = (^ r ) / u J (l — ^i) Nr ~^ . Therefore, by denoting the number of feedback bits per user 
by R FB and taking into account that if a user satisfies the normalization constraint, it will feed 
back [log QN t ~] bits, we get 

N N /N\ 

E[R FB ] < \logQN t ] Y,Wj = nog QN t ] £j( r )lJ, j (l-ri Nr - j - 

Hence, the aggregate amount of feedback in the n th super-cell, denoted by E[i?^ B ] in the asymptote 
of large K is 

JV 



E[R F n B ] < lim K\] 0g QN t ]jrj( N r)pi(l-ri 

N fN\ 
= [log Qi^] Vj . r lim 



'N r \ . \KNrl \ KN r 



3=1 yJ/ K 

N 



= QN t \logQN t ] , (27) 

which provides that the expected aggregate amount of feedback bits in the network does not exceed 
MQNt [log QNt] . The last equality holds by noting that for j > 1 the term K j-i — > and the only 
non-zero term corresponds to j = 1 and is equal to QNt [log QNt] . ■ 
The simulation results provided in Fig. [3] quantify the aggregate amount of feedback imposed 
by our scheduling protocol to the feedback channel. As anticipated by the analyses, as the number 
of users increase, the total amount of feedback bits is QNt [log QNt] which is independent of the 
number of users as well as the number of receive antennas. This implies that as the number of users 
increase, only QNt users will participate in feeding back their indices of their desired beams. This 
is due to the fact that high normalizing factors for networks with large number of users, only allow 
the best QNt users to feed back the indices of their corresponding beams. For the simulations we 
consider the settings Q = 1,2 base-stations per super-cell and assume that each user has N r = 3 
receive antennas. 
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6 Practical Issues 
6.1 Size of Super-cells 

The size of the super-cells Q impacts both the scaling rate of the sum-rate throughput (Theorem Q] 
on the MISO capacity scaling) as well as the amount of information exchange in the network (The- 
orem [3] on aggregate feedback). The capacity scaling analysis shows that the sum-rate throughput 
scales as MQNt log log K N r . This is the scaling of a network with a total number of MQ base- 
stations and MK users, which means there exist T = £ users per base-station. When keeping this 
ratio of users per base-station constant, the sum-rate throughput scales as MQNf log log QTN r . 
This scaling essentially implies that as long as the product MQ is a constant, the sum-rate scaling 
remains unchanged. The reason is that for fixed MQ, necessarily M will be a finite integer, based 
on which we have 

.. ®h[R] loglogifiV r 

lim = lim = 1 

K->oo MQN t log log MKN r A'->oo log log MKN r 

which implies that the throughput of the proposed procedure scales identical to the scaling law 
provided in Theorem [2j Therefore, when the total number of base-stations in the network MQ is 
fixed, irrespectively of how we cluster them to construct the super-cells, the scaling law remains 
fixed. On the other hand, from Theorem [T] (MISO capacity scaling), the aggregate amount of 
information exchange is upper bounded by MQNt [log QNt] ■ Once having a fixed number of base- 
stations in the network, i.e., MQ is fixed, the amount of information exchange is clearly minimized 
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figure 4: Ihe impact of super-cell sizes Q on the achievable sum-rate throughput. 

by setting Q = 1. This choice of Q corresponds to the setting that there is no cooperation among 
the base-stations and each base-station acts independently of others. This fact in conjunction with 
the freedom in clustering the network shows that even when the network is completely distributed 
(no base-station cooperation) by exchanging as low as MNt log N-t information bits, still the optimal 
sum-rate throughput scaling is achievable. 

There exists, however, a cost associated with such distributed downlink transmission. While 
the sum-rate throughput of the proposes scheme and that of dirty-paper coding scale identically, 
there exists a gap between them. The distributed processing penalizes the performance by en- 
larging this gap. By increasing the size of the super-cells Q we can reduce this gap, which shows 
that we can trade additional information exchange in favor of increasing the achievable sum-rate 
throughput, which is expected intuitively. The monte-carlo simulation results in Fig. U depicts 
the impact of clustering on the achievable sum-rate throughput. We consider a network contain- 
ing MQ = 6 base-stations and look into the sum-rate of the four possible clustering cases of 
(M, Q) = (6, 1), (3, 2), (2, 3) and (1,6). While all four cases exhibit identical scalings, it is observed 
allowing more level of cooperation among the base-stations, i.e., large clusters sizes Q, improves 
the sum-rate throughput. For the simulations we have considered the two cases of Nt = 1,4 and 
have assumed that the users have A^ r = 3 receive antennas and p = 10 dB. 
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6.2 Fairness 

In a non-homogenous network the reception quality at various users not only is influenced by how 
their channel direction vectors are aligned with the beam direction, but also is impacted by the 
path-loss and the shadowing they experience. As scheduling in our scheme involves opportunistic 
selection of the users who have satisfied a minimum level of reception quality, the system might be 
dominated by the user with the strongest reception qualities. In general, there exist an inherent 
tension between opportunistic user selection and maintaining some notion of fairness among the 
users. Therefore, it is imperative to investigate how such opportunistic user selection affects fairness. 

It is, interestingly, observed that by deploying the proposed scheduling procedure, fairness, in 
the sense that all users are equiprobable to be served by the base-stations, is maintained. Such 
fairness is preserved as we assign different normalization factors to the users with different path-loss 
and shadowing (7^). This argument is formalized in the next theorem. 

Theorem 4 (Fairness) For downlink transmission in a MIMO network with M super-cells each 
with Q base-stations, K users per super-cell, Nt transmit antenna per base-station, and N r receive 
antennas pre user, all users are equally likely to be served by the proposed scheduling procedure. 

Proof: The proof relies on the choices of the normalization factors given in (|26p . Let us define 
SINR^(r, £, i) as received SINR of the i th receive antenna of user U k when served along the beam- 
forming vector u r n (£). By following the same line of argument as in (|25p and taking into account 
(|26p . the probability that the i th receive antenna of user U k is a candidate to be scheduled with 
the beamformer ui T n (€) is 

6 n t P (s\m k n (r,e,i) > /3*(r)) = 1 - F^ k n (r);r) = ^L- , (28) 

which is identical for all users (in the n th super-cell) and all transmit and receive antennas. There- 
fore, the probability that exactly j < min(Nt, N r ) number of the receive antennas of user k being 
eligible to be served by a base-station is 

which is also identical for all users in the n th super-cell. Therefore, all users in each super-cell are 
equally likely to be scheduled for receiving any certain number of information streams. ■ 
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6.3 Training 

As briefly mentioned at the end of Section [3] each user needs to calculate the SINRs that each of the 
beamformers can sustain for carrying the information intended for that user. Acquiring such SINRs 
is viable during the training interval. Such training being carried out on the downlink channel will 
cost downlink bandwidth. Nevertheless, it can be readily shown that such bandwidth loss does not 
penalize the throughput scaling rate. More specifically, due to the broadcast nature of the wireless 
channels, the base-stations will transmit a common training message for all users. The duration of 
such training will be fixed and independent of the number of users in the network. If we denote the 
fixed throughput loss due to such training by 22 tra j n , then the throughput after penalizing it by the 
training bandwidth loss will be Ej/fi?,] — -Rtrain- Strain being fixed provides that for increasing K, 
Ejf [22] — -Rtrain will scale similar to E//[2?]. Therefore, while training incurs a loss on the throughput, 
it will not affect the throughput scaling rate provided in Theorem [2] 

7 Conclusions 

In this paper we have assessed the minimum amount of information exchange required in a multi- 
cell multiuser downlink transmission with base-station cooperation, in order to retain the optimal 
capacity scaling. By providing a constructive proof we have demonstrated that by an appropriate 
design of a feedback mechanism for each super-cell, we can avoid any cross- (super) cellular infor- 
mation exchange and yet achieve the optimal scaling law. More specifically, for a MIMO network 
consisting of M super-cells, each consisting of Q base-stations that carry out the downlink trans- 
mission collaboratively, when each base-station is equipped with Nt transmit antennas, exchanging 
MQNt [log QNt] information bits across the network suffices to guarantee the achievability of the 
optimal scaling of the sum-rate throughput. This result relies on a constructive proof, where we 
offer a scheduling framework that satisfies the information exchange constraints. Furthermore, the 
scheme also provides good performance for practical networks and sustain fairness among the users 
in the network. 
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A Proof of Lemma [T] (Conditional Expected Rates) 

Scheme of the Proof: 



KW = {k\ s\m k n (r,£) > 1} . (29) 

This is due to the fact that when /3^(r) > 1, each user in the super-cell Q n will have a normalized 
SINR greater than 1 for at most one of QNt beamformers {uj r n (£)} rj £, i.e., if a user has a normalized 
SINR > 1 for a beamformer, its normalized SINR for the rest of beamformers is smaller than 1. 
This can be readily verified by noting that when /3^(r) > 1 the we have 

sinr£(r,£)>l i SINR*(r,i) > 0*(r) > 1 i SINR£ (/,/)< 1 V(r',/)^M) 
§ sinr£(rV) < 1 V(r',/) + (r,£) 

Given the above characterization of W n {t) in (|29p . the rest of the proof has three major steps. In 
the first step by using the characterization of 7-L r n {£) and some simple manipulation the conditional 
expected value of the rate Ejj[i?^(£) | T-L r n {£)\ is reformulated. In the second step, we find a lower 
bound on this conditional expected rate. This lower bound itself is also a conditional expected 
rate with a different condition compared with Eff[i?^(i?) | T-i^t)]. Finally, in the third step we 
show how removing these new conditions from the conditional expected rates will provide a new 
lower bound on Ejj [R r n {£) \ T~i r n {£)} which is appropriate related to the order statistics of the set 
{SINRiM),...,SINR*(r,£)}. 



Step 1) Reformulating W. H [R r n {£) \ U 
By recalling ([9]) we have 

1 



%KW I K( 



\u r j 



£ E„ [log (l + SINR* (r,£)) 



(30) 



On the other hand, from (|29p we have 



Vfc € H r n (£) : s\n^(r,£) > 1 and VA;' i U r n {£) : sinr£'(£) < 1 . 



(31) 
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By combining (|30p and f)31j) we find that 



^— ^ E H [log(l + SINR*(r,^)) |Vfc€K(^):sinr*(r^)>l; 



E 



J/ 



log(l + SINR*(7v 



and VA;' £ = sinr^'(^) < 1 

Vken r n (£):S\NR k n (r,t)>tt(r); 



and V/c' ^ : SIN {£) < P% (r 



ok' i 



, (32) 



where in the first equality we have replaced (|3ip into (j30|) and in the second equality we have 
invoked that sinr*(r,i) = Sm ^ l) . 



Step 2) Lower Bound on E H [R r n (£) \ H r n (£)}: 

In the next step we manipulate the conditions of the expected rate given in (|32[) and obtain a 
more tractable lower bound on Ejj[i?^(^) | / Hl l (£)]- The core of such manipulation is given in the 
following lemma. 

Lemma 6 (Conditional Expectation) For a random variable X, increasing function <?(•) and 
real values b > a that belong to the support of X we have 



E 



g(X) | X > b > E g(X) \ X > a 



(33) 



Proof: Define Y = g(X) and denote the CDFs of X and Y by Fx(x) and F y (y), respectively. 



E 



g(X) \ X>b 



y d(F Y i x >b{y)) 



F y)x>i,( u ) du ■ 



On the other hand, by recalling that g(X) is monotonic in X 

P(b<X< g~ l {y)) _ F x {g-\y)) - F x (b) 



(34) 



f y i x> b (y) = P{g{x) <y\x>b) 

F x {g~ 1 (y))-F x {a) 



< 



which yields that 



l-F x (a) 

Fy I x>a{y) , 

Vu 6 [0, 1] : 



P(b < X) 
P(g(X) <y\X>a) 



l-F x (b) 



Fy] x > b (u) > Fy] x > a (u) 



(35) 
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Equations (|M|) and (|35j) provide that 

-I 



E 



sPO I x > b 



> 







F Y I X><» dn 



y | x>a(y)) = E g(X) \ X>a 



Next we construct a hypothetical event under which the users in 'K r n {€) satisfy some other SINR 
constraints. For this purpose, we introduce the set of users % r n {£) corresponding to T-i r n {£) such that 



H r n (£)^{k\S\NR k n (rJ)>min^ n (r)} . 



(36) 



Now consider a hypothetical channel condition for which the event T~L r n {£) includes the same set 
of users as the the event ~}i r n {f) does. Under this hypothetical conditions, i.e., % r n {£) = H^l), by 
recalling the characterization of Mjj [R^(i) | W n {£)] given in (f3"2"j) we set b = (3 k (r) and a = min, j3 l n {r) 
and replace the conditions SINR^(r, £) > f3 k (r) by SINR^(r, £) > minj/3^(r). By noting that b > a 
from Lemma [6] we obtain 



e h [K(£) I K 



GEO i 

> 



E E - 



log 1 + SINR*(r,£) I VA; G H r n (£) : S\NR k (r,£) > min^(r); 



and VA;' £ : SINR£(*) < /3*'(r 



(37) 



Next, by taking into account the statistical independence of SINR^(r;£) and SINR^ (r;£) for A; € 
i-L r n (£) and A;' ^ 7i/^(£), which is due to the statistical independence of channel matrices -tf^'m 
and H^'m, changing the conditions enforced on S\NR k (r;£) for k' ^ ^ n if) does not affect the 
statistics of SINR^(f) for k £ H r n {£). Therefore, slightly changing the conditions SINR^'(£) < /3 k ' (r) 
to SINR^'(i) < mmi/3 l n (r) for all k' (£ H r n (l) does not change the distribution of S\NR k (r,£) and 
(|3"7|) can be equivalently cast as 



EhKW I K 
i 



> 



\w r 



E E - 

fce«£(i) 



log(l + SINR£(r,f)J \ Mke U r n {£) : S\NR k (r,£) > min^(r); 

and VA;' £ K(£) : SINR^) < mm(3 l n (r 



(38) 



The new condition 



VA; € H r n {£) :S\HR k n {r,£)> min/£(r) and VA;' £ : SINR*'(£) < min^r) , 
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implies that under this condition, the set % r n {i) contains the \H r n (l)\ = \H r n (l)\ largest values of the 
set 

{SINR ? \(r,^),...,SINR^(r,£)} , 

and the lower bound on Ejj[i?^(£) | T-L r n {()] given in (|38j) is the arithmetic average of these 
largest values conditioned on the constraint that they are larger than or equal to minj/^(r) and 
the remaining (K — |%„(^)|) SINR values are smaller than minj/?^(r). This interpretation can be 
formalized as follows. 



i Km 

1 \H r n (e)\ 



logfl + SINR^(r,£)) I SINR^(r,£) >mmf3 l n (r) > SINR^ +1 )(r 



(39) 



where we have defined A = \H 



r i 



Step 3) Removing the Condition in (l39|) : 

In the final step, we further simplify the lower bound on Ejj \R r n {^) I ^n(^)] provided in (f39l) . This 
step will be the immediate application of the following lemma. 

Lemma 7 (Conditional Expected Value of the Order Statistics) Consider the set of inde- 
pendent random variables {X\, . . . , Xk} and its corresponding set of order statistics {X^\ . . . , X^ K ^}. 
For any increasing function g(-) and the real value a belonging to the supports of X^ we have 



Vj <l 



E 



g(X^) | X^ <a<X 



(I) 



> E 



(40) 



Proof: In the first step, by following the same line of argument as in the proof of Lemma [6] 
(conditional expectation) it can be readily shown that 



V j < I : E 



| X«+V <a<X 



(l) 



> E 



g(X^) | X^ <a< X® 



In the second step we also prove that 



Vj : E 



l(xW>) | X^+V < a < X® 



> E 



(41) 



(42) 



Equations (|41|) and (|42|) together establish the desired result. For proving (|42|) we start by showing 
that for any real b > a 



Vj : P fx® < b I X( j+ V < a < X®) < P (x& < b 



(43) 
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For this purpose let us define 7Tj = (Xj 15 . . . , Xi K ) as the i th possible ordered permutations of the 
set {Xi, . . . , Xk} for i = 1, . . . ,K !. By averaging over all such permutations we have Vj 

P (X^ < 6 | X^' +1 ) < a < X^) 

= J^P (jW < b | < a < I», . . ,#>) = TTi) P ((XW, . . ,#)) = 7T,) 

i 

= 5^P(X il ,...,X ij _ 1 <+oo and X^.,...,^ <b\ X ij+1 , . . . , X ifC < a < X h , . . . , X^) 



XP((X( 1 ),...,XW)=7T i ) 

P ({a < X h , . . . , < +00} and {a < X i} < b} and {X ij+1 ,...,X iK <b}) 

P({a<X h ,..., Xi.^ < +00} and {a < X { . } and {X ij+1 ,...,X iK <b}) 

xP((X( 1 ),...,xW)=vr i 



E 



^ P (X . < 6) - P (X . < a) / m 

^ "/ >< ff(^ (1) ,...,I (K) ) = TTi 



< 



E 



P(a<Xj 
P < b) - P (Xt. < a) P (Xi. < b) 



pf(X( 1 ),...,xW) = vr, 



P(a<X ij ) 

= ^P(X ] <b)xP((xW,...,X^) = n l 

i 

= p (x® < ft) , 

which establishes the claim in ([4~3|) . By defining — g(X^) and denoting the CDFs of X^) and 
y(j) by P X (j)(x) and F Y u)(y), respectively, from (|4"3j) we immediately have 

(y)<Py(,)(y) VnG [0,1]: F'^ ! x0+1) < a < x0) H > F-JjH . (44) 

Therefore, we get 
E 



lu=0 
Bit /•! 

> 



7-1 

yU) I X(J+ 1 )<a<JCO) 



(tt) 



F-^(u) du= yd(F Y(j) (y)) = E \g(X®) 



'u=0 ^j/ 

that justifies (J32J) , which in turn along with ([4"Ij) concludes the desired result. 
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Now, by using the results of the lemmas above and by setting X^) = SINR„ (r,t) for j € {1, . . . ,K}, 
g(x) = log(l + x), and I = A = \T-C!^(£)\, through invoking (|40p (main result of Lemma|7| we obtain 



Vj < A : E H 



log (l + SINR^(r,£)) I SINR^(r,£) > min^(r) > SINR^+^r 



> E 



H 



log(l + SINR^(r,£) 



Combining (|39p and (|45|) provides that 



EhKW I KW] > 



\n r n (i)\ 



\n r n(t)\ 



i=i 



logfl + SINR^(r,£) 



which is the desired result. 



(45) 



(46) 



B Proof of Lemma 1 (SINR Bounds Distributions) 

Based on these we define the following lower bound on SINR^(r, €) 



S k n (r,£) 



H K u 



I 12 



%(V V, \H k u l \ 2 -\H k a^| 2 ) + A- 



and the following upper bound on SINR^(r, 



I" n,n^n 



I |2 



(47) 



(48) 



Let Y = |i3"^'^cj^(^)| 2 . Since H^u^i) ~ CAf(0, l),Y has exponential distribution with unit vari- 
ance. Also define Z = J2 m J2 r , J2 e , \H^ n u r n (i)\ 2 - \H k >n u> e n \ 2 which is the summation of MQN t - 1 
independent exponentially distributed random variables each with unit variance. Therefore, Z has 
a Gamma(MQNt — 1, 1) distribution. By denoting the probability density functions (PDF) of Z 
and Y by 



My) 



e 2 and fz(z) 



z MQN t -2 e -z 

(MQN t ~ 2)! 



the PDF of T k (r,£) 



Y 



i — , denoted by fr(t) is 



it) = / 
J o 



fo\z(t\ z)f z {z)dz 



V/«7n Tn 



MQN t — 2 e -z 



(MQN t ~ 2)! 



-t + 1 



MQN t 



^ t -f- 1 /"I 

^ 9 + %(MQ^ - 1) 

P»7n % 
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where the last step holds as J °° e u u M = Ml. Some simple manipulations the CDF of T^(r,£) is 
obtained as 



F 2 (x;n)= / f T (t)dt = l 
Jo 



-x/prj. 



2 



•if, 

F\(x\ n) can be found by following the same line of argument. 



(% x+ l)MQN t -l 



C Proof of Lemma 1 (Ordered SINR Bounds) 

By induction we show that for given n, r, £, for all j £ {1, . . . , K}, we have 5i j) (r,£) < SINR^(r,£). 

1) For j = 1 we have 

SINR^M) = maxSINR£(r,£) > max5*(r,€) = Sfi(r,£). 

2) Assumption: For some j = I we have s£°(r,0 < SINR^(r.i). 

3) Claim: For j = I + 1 we show that ^ +1) (r,^) < SINRi' +1) (r,f). 

Based on the definition of S*(r,£), each of the (K - I) terms {SINRi' +1) (r, £), . . . , SINR^ } (r, £)} 
is greater than one corresponding element in the set {S^(r, £),..., (r, £)}. Therefore, there 
cannot be more than I elements in the set {S^(r, £),..., (r, £)} that are all greater than all the 
members of {SINR^ +1 ^(r, £), . . . , SINR„ ^(r, £)}. Now, by contradiction assume that S^ +1 \r,£) > 
S\NRn +1 \r,£). This assumption provides that 

S^(r,£) > S^(r,£) >■■■> S« +1) (r,£) > SINR^M) > S\NR^ 2 \r,£) > SINRW(r,£) , 

which indicates that we have found l + l members of the set {S^(r, £),..., S^(r, £)} that are greater 
than all the (K — I) members of the set {SINRn (r, £),..., SINR«^(r, £)} which contradicts with 
what we found earlier. Hence, we should have S { n +1 \r,£) < SINR^ +1) (r, £). The proof for the 
inequality T^\r,£) > SINR^ (r, £) follows the same line of argument. 



D Proof of Lemma |4] (Bounds on R n ) 

The proof heavily depends on the result of the following lemma. 

Lemma 8 For a real variable x £ [0, 1] and integer variables K and j, < j < K — 1, the function 

i=0 v 1 ' 

is increasing in x. 
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Proof: See Appendix [Fj ■ 

Note that from the properties of the order statistics [19] we know that 

j— i 

F^ix-n) ± Yl ( K ) (Fi(x;n)) K ~ t (l - fifon))' , (49) 
i=0 ^ 1 ' 

F®(x;n) ± Yl ( K ) (F 2 (x;n)) K ~ l (l - F 2 (x;n))' (50) 
i=0 ^ % ' 

G W(x) ± £ (^) (G(x)) K ~\l - G(x)) 1 . (51) 
i=o ^ 1 ' 

Also by referring to the definitions of G{x) and F\{x\ n) we find that 

X --(MQN t -l)(^{l + x) 



g((MQJV,-1)(^(1 + i)) + 4) = 1-exp 



>ln 



^ 1 ~~ 7, \ MQNt — i = F i{x;n) 



1 + ^fx 



By applying Lemma [SJ for j = 1, . . . , K, we have 

-2 



GW((MQiY t -l)(^(l + x)) + -| r ) = /(G((W-l)(^(l + I ))+^) ( j-l 

\ 'In r'ln/ \ 'm r'ln 



> /^i(z;n),i-l 



and consequently, 

G^ ((MQN t - l)(4(l + x)) + £r;n,r,£) = ^ g£(r, ^)G (i) ((MQN t - 1)(4(1 + *)) + £r) 

V 'm F'/n / V 'm r'/n / 

if 

> ^^(r,^F 1 (j) (x;n) = Ff(x;n,r^). (52) 

Define u = F^(x; n, r, £). Since (x; n), j = 1, . . . , if, are increasing functions in x, the function 
F^(x;n,r,£) is also increasing in x and thereof convertible. Therefore, (|52p can be rewritten as 
x = (F^)~ l (u;n,r, £). By noting G K (x) is also invertible we have 

G K f(MQN t - 1)(4(1 + (FfrHw^))) + > U , 

or equivalently 

pC 2 n {MQN t -l) + l)(l + (F«y\u-n,£) \ > 1 + pr 1 1 n {G K )- 1 {u; n, r,£) . (53) 
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du 



du 



Using (|53j) we find 

V / log(l + x) dF«(x; n, r, £) + QN t log (p£(MQN t - 1) + 1 

r,t Jo V 

= Y, I lQ g ( X + (FfrHu; n, r, A du + QN t log (p£{MQN t - 1) + l) J 
r ,e Jo 

> W log(l + ^(G^-Hujn, 

r, < ^ 

/>oo 

= V / log(l + pr^x) dG* (x; n, r, I) . 
r, e Jo 

By substituting (I54p into (I2ip we get the following lower bound on R n 

V f°log(l + Pll x) dG K (x;n,r,l) - QN t log (p£(MQN t - 1) + l) < R n . (55) 
r,e Jo v y 

Also it can be easily verified that Vx £ M, i 7 ^^; n) >G\ ) . Hence, 



' p£(MQN t - 1) + 1 ) ( 1 + (Ff )" 1 (n;n,r,£) 



, r\ I (in 



K 

F 2 K (x; n, r,£) = J2 Qj(r, F ? & n) 
j"=i 

A 



= ^^(r,£)/(F 2 (x;n),j-l) 



j 

p{ K PVn J K PVn 



(54) 



A F K (x"n £) 

By defining = 2 a and following the same lines as above we ge 

PVn 

V/ log(l + x) dF?{x\n,r,t) < V / log(l + p~f 2 x) dG K (x;n,r,£) . (56) 
Inequalities in (j54"j) and (f56|) together give rise to the following upper bound on R n 

/•oo 

Rn<J2 log(l + P72X) dG 7< (x; n, r, £) . (57) 

p, £ ^ 

Combining the lower and upper bounds given in (155p and (|57p establishes the desired result. 
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E Proof of Lemma [5] (Scaling) 

We start by citing the following theorem. 

Theorem 5 [13] Theorem 4] Let {X n }!^ =1 be a family of positive random variables with finite mean 
Hk and variance o~\, also hk — > oo and — > as K — > oo. Then, for all a € M. + we have 

log(l + aX K )] = log (l + aE[X K ] 



E 

Now, for any given number of users A per super-cell, we define a random variable Xk that is 
distributed as Xjc ~ W K (x). Also for j = 1, . . . , K we define 



A 

^0) - 



OO 

X t 







and 4) = 



o 



3 ' \ 2 

/>oo 

and /i X = E[Xat] = x dW K {x) 

Jo 

= £ Qj / » dW&') (x) = ^ Q jm . 

3=1 3=1 

As discussed in [19 ] 112 1 Sec. 4.6], for ordered exponentially distributed random variables Xk we 
have 

o\ < 2 + 2// (1) - , (58) 

and log A + C + 2(A . X + < M(i) < log A + C + ^ , (59) 
where by noting that £ ~ 0.577 is the Euler-Mascheroni constant we have = \ogK. Also 

A' 



M(i) - log f ^jQj J ~ C - 0.5 <fj, K < 
V i= i / 



By taking into account the constraint in (I23p . as A — > 00 



log^EjLiiQj) - C-0.5 



< I-lk < 1. (60) 

log A 

Equations ([59]) and ([60]) together show that 

M(i) = MA- = log A, (61) 
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which also implies that [ik — > oo. Taking into account (|58p and (|6ip we also conclude that 



lim^^oo ^ = and therefore the conditions of Theorem [5] are satisfied. Hence, from Theorem [5] 



log(l + ax) dW K (x) = E [ log(l + aX K )^ = log (l + aE[X K 
= log ( 1 + cluk) = log log A + log(a) . 



F Proof of Lemma 



By the expansion of ( x + (1 — x) ) we have 



A' 



K-(j+l) 



u ' /A\ 

1- ^ . (l_ x )^-V = l-/(l-x,^-j-l), 

4 = ^ Z ' 



where it can be concluded that f'(x,i) = f'(l — x, K — j — 1). So it is sufficient to show that 
f'(x,i) > for x < g and for all j = 1, . . . ,K — 1. For this purpose we consider two cases of 
j< LfJ and j > Lfj- 
Case 1: j < [f J 



i=0 



A" 



jx^-^l-x) 4 



A' 



ix^l-x) 



i-l 



i=0 



A 



E( ■ (A-i^"*"^! -x)*" 1 A(l-x)-z 



(62) 



where it can be shown that for x < i 



A(l - x) - t > A(l - x) - j > K(l - x) = — (1 - 2x) > 0. 

Case 2: j > [f J 

Define a, = 1 — ^(LtJ ~~ *)> wnere <5(") is the Dirac delta function. Therefore, we get 



LfJ 



/(x,j) = /(x,A-j-1)+ E a U • x^(l-x)* + x l (l-x 



i=K-j 



A 



\A- 



33 



For x < \ we get 



fix, 3) = f'(x,K-j-l) 
'K 



+ E 



a, 



--K- 



K-i-1 



(1-x) 



i-1 



K — i — Kx 



+ x l -\l -x) 



K-i-1 



i — Kx 



> x K-t-±(i_ x y-\ 



LfJ 



> f'( x ,K-j-l)+ 



<Lfj 



--K-i 



K 



K-i-1 



(1-x) 



i-1 



K - 2Kx 



>o 



> 



(63) 



From (j62|) and ([63]) it is concluded that for x < i, f(x,j) is an increasing function of x, which 
provides the proof. 
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